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Abstract. Zielonka's theorem shows that each regular set of Mazurkiewicz traces 
can be implemented as a system of synchronized processes with a distributed 
control structure called asynchronous automaton. This paper gives a polynomial 
algorithm for the synthesis of a non-deterministic asynchronous automaton from 
a regular Mazurkiewicz trace language. This new construction is based on an 
unfolding approach that improves the complexity of Zielonka's and Pighizzini's 
techniques in terms of the number of states. 
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Introduction 

One of the major contributions in the theory of Mazurkiewicz traces [3] characterizes 
regular languages by means of asynchronous automata [15] which are devices with a 
distributed control structure. So far all known constructions of asynchronous automata 
from regular trace languages are quite involved and yield an exponential explosion of 
the number of states [9]. Furthermore conversions of non-deterministic asynchronous 
automata into deterministic ones rely on Zielonka's time-stamping function [6, 10] and 
suffer from the same state-explosion problem. Interestingly heuristics to build small 
deterministic asynchronous automata were proposed recently in [13]. 

Zielonka's theorem and related techniques are fundamental tools in concurrency 
theory. For instance they are useful to compare the expressive power of classical models 
of concurrency such as Petri nets, asynchronous systems, and concurrent automata [14, 
7]. These methods have been adapted already to the construction of communicating 
finite-state machines from regular sets of message sequence charts [8]. More recently 
the construction of asynchronous cellular automata [2] was used to implement globally- 
cooperative high-level message sequence charts [5]. All these constructions yield an 
exponential explosion of the number of local states. 

In this paper we give a polynomial construction of non-deterministic asynchronous 
automata. Our algorithm starts from the specification of a regular trace language in 
the form of a possibly non-deterministic automaton. The latter is unfolded inductively 
on the alphabet into an automaton that enjoys several structural properties (Section 2). 
Next the unfolding automaton is used as the common skeleton of all local processes 
(Subsection 3.2). Our algorithm is designed specifically to ensure that the number of 
local states built is polynomial in the number of global states in the specification (Sub- 
section 3.1). We show how this approach subsumes the complexity of Zielonka's and 
Pighizzini's constructions (Subsection 1.3). 
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1 Background and main result 

In this paper we fix a finite alphabet E provided with a total order C. An automaton 
over a subset T C E is a structure .A = (Q, i, T, — F) where Q is a ^zw/e set of 
states, i G Q is an initial state, — >C Q x T x Q is a set of transitions, and F C Q is a 
subset of final states. We write g — ► g' to denote (g, a, q') G — Then the automaton 
.A is called deterministic if we have q — g' A g — g" q' = q" ■ For any word 
u = ai...a„ G we write g — ^ g' if there are some states go, 5i, ■•■7 <Zn G Q such 
that q = qo q\...q n -i — ^ q n = q' . A state g G Q is reachable if z -^-> g for some 
m G Z 1 *. The language L(.A) accepted by some automaton A consists of all words 
u G S* such that 1 -^-> g for some q E F. A subset of words i C S* is regular if it is 
accepted by some automaton. 

1.1 Mazurkiewicz traces 

We fix an independence relation \\ over S, that is, a binary relation || C £ x £ which 
is irreflexive and symmetric. For any subset of actions T C Z 1 , the dependence graph 
of T is the undirected graph (V, S) whose set of vertices is V = T and whose edges 
denote dependence, i.e. {a, b} G E <^=> aj[6. 

The frace equivalence ~ associated with the independence alphabet (X 1 , ||) is the 
least congruence over Z 1 * such that a& ~ 6a for all pairs of independent actions a\\b. 
For a word u G Z 1 *, the trace [u] = {v G S* \ v ~ w} collects all words that are 
equivalent to u. We extend this notation from words to sets of words in a natural way: 
For all L C Z 1 *, we put [L] = {v G Z 1 * | 3u G L, w ~ w}. 

A frace language is a subset of words L C S* that is closed for trace equivalence: 
ueLAv^u^-v^L. Equivalently we require that L = [L\. With no surprise a 
trace language L is called regular if it is accepted by some automaton. 

1.2 Asynchronous systems vs. asynchronous automata 

Two classical automata-based models are known to correspond to regular trace lan- 
guages. Let us first recall the basic notion of asynchronous systems [1]. 

DEFINITION 1.1. An automaton A = (Q,i,£, — F) over the alphabet £ is called 
an asynchronous system over (£, ||) if we have 

ID; qi -—> q 2 A q 2 — —> A a\\b implies q\ — —> g 4 A q^ -—> q 3 for some q^ G Q. 

The Independent Diamond property ID ensures that the language L(A) of any asyn- 
chronous system is closed for the commutation of independent adjacent actions. Thus 
it is a regular trace language. Conversely it is easy to observe that any regular trace 
language is the language of some deterministic asynchronous system. 

We recall now a more involved model of communicating processes known as asyn- 
chronous automata [15]. A finite family S = (Uk) keK of subsets of E is called a distri- 
bution of(E, ||) if we have ajfb 3k G K, {a, b} C Ek for all actions a, b G E. Note 
that each subset Ek is a clique of the dependence graph (E, Jf) and a distribution 8 is 
simply a clique covering of (E, We fix an arbitrary distribution S = (Ek) keK in the 
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rest of this paper. We call processes the elements of K. The location Loc(a) of an action 
a £ £ consists of all processes k £ K such that a £ Sk- Loc(a) = {k £ K \ a £ Ek}. 

DEFINITION 1 .2. An asynchronous automaton over the distribution {E k ) keK consists 
of a family of finite sets of states (Qk) keK , a family of initial local states {ik) keK 
with ik £ Qk, a subset affinal global states F C J} fceA > Qk, and a transition relation 
°a Q ILeLoc(a) Qk X ILeLoc(a) Qk for each action a £ E. 

The set of global states Q = Yl keK Qk can be provided with a set of global transitions 
— ► in such a way that an asynchronous automaton is viewed as a particular automaton. 
Given an action a £ E and two global states q = (qk) ke x an d r = ( r k) ke K> we P ut 
q r if ((q k ) keLoc{a) , (r k ) keLoc{a) ) e d a and q k = r k for all k £ K \ Loc(a). 
The initial global state i consists of the collection of initial local states: i = {ik) ke K- 
Then the global automaton A = (Q, i, E, — >, F) satisfies Property ID of Def. 1.1. 
Thus it is an asynchronous system over (£, ||) and L{A) is a regular trace language. An 
asynchronous automaton is deterministic if its global automaton is deterministic, i.e. 
the local transition relations d a are partial functions. 



1.3 Main result and comparisons to related works 

Although deterministic asynchronous automata appear as a restricted subclass of deter- 
ministic asynchronous systems, Zielonka's theorem asserts that any regular trace lan- 
guage can be implemented in the form of a deterministic asynchronous automaton. 

THEOREM 1.3. [15] For any regular trace language L there exists a deterministic 
asynchronous automaton whose global automaton A satisfies L — L(A). 

In [9] a complexity analysis of Zielonka's construction is detailed. Let \Q \ be the num- 
ber of states of the minimal deterministic automaton that accepts L and \K\ be the 
number of processes. Then the number of local states built by Zielonka's technique in 
each process k £ K is \Q k \ < 2 0( - 2lK> log The simplified construction by Cori 
et al. in [2] also suffers from this exponential state-explosion [3]. 

Another construction proposed by Pighizzini [12] builds some non-deterministic 
asynchronous automata from particular rational expressions that refine Ochmanski's 
theorem [11]. This simpler approach proceeds inductively on the structure of the ra- 
tional expression. Each step can easily be shown to be polynomial. In particular the 
number of local states in each process is (at least) doubled by each restricted iteration. 
Consequently in some cases the number of local states in each process is exponential in 
the length of the rational expression. 

In the present paper we give a new construction that is polynomial in \Q\ (Th. 3.1): 
It produces \Q k \ ^ 0(\Q\ d ) local states for each process, where d = {2.\E\+2)\ S \ +1 , 
| S | is the size of E, and | Q \ is the number of states of some (possibly non-deterministic) 
asynchronous system that accepts L. Noteworthy the number of local states \Q k \ ob- 
tained by our approach is independent from the number of processes \K\. 
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2 Unfolding algorithm 

In the rest of the paper we fix some automaton A = (Q,i,£, — F) that is possibly 
non-deterministic. The aim of this section is to associate with A a family of automata 
called boxes and triangles which are defined inductively. The last box built by this 
construction will be called the unfolding of A (Def. 2.3). 

Boxes and triangles are related to A by means of morphisms which are defined 
as follows. Let Ai = (Qi,i\,T, — and Ai — (Q2,i2,T, — > 2 ,F 2 ) be two 
automata over a subset of actions T C S. A morphism a : A\ — > A2 from A\ to A2 
is a mapping a : Q\ — > Q 2 from Q\ to Q2 such that a(i\) — 12, &{Fi) C F 2 , and 
qi -^1 q[ implies a(qi) -°^2 &(q'i)- In particular, L(Ai) C L(yi 2 ). 

Now boxes and triangles are associated with an initial state that may not correspond 
to the initial state of A. They are associated also with a subset of actions T C E. For 
these reasons, for any state q e Q and any subset of actions T C 17, we let Ax. q denote 
the automaton (Q, q, T, — > T , F) where — > T is the restriction of — ► to the transitions 
labeled by actions in T: — > T = — > f](Q xT x Q). 

In this section we shall define the box D^q for all states q G Q and all subsets of 
actions T C E. The box ^ T q is a pair (23t, 9 , /^t,,) where By,, is an automaton over 
T and (3r, q ■ ^>T, q — > -At,, is a morphism. Similarly, we shall define the triangle At,, 
for all states q and all non-empty subsets of transitions T. The triangle At., is a pair 
i~T, q ) where Tr, q is an automaton over T and tt, q ■ 7r,q — > A-T,q is a morphism. 

The height of a box Cl T q or a triangle At,, is the cardinality of T. Boxes and 
triangles are defined inductively. We first define the box Dj^ for all states q E Q. Then 
triangles of height h are built upon boxes of height g < h and boxes of height h are 
built upon either triangles of height h or boxes of height g < h, whether the dependence 
graph (T, j[) is connected or not. 

The base case deals with the boxes of height 0. For all states q 6 Q, the box n^^ q 
consists of the morphism /?0 ;9 : {q} — > Q that maps q to itself together with the au- 
tomaton ®0 i9 = ({<?}, q, 0, 0, F®^) where Fq^ = {q} if q e F and Fq^ = otherwise. 
More generally a state of a box or a triangle is final if it is associated with a final state 
of A. 

2.1 Building triangles from boxes 

Triangles are made of boxes of lower height. Boxes are inserted into a triangle induc- 
tively on the height along a tree-like structure and several copies of the same box may 
appear within a triangle. We want to keep track of this structure in order to prove proper- 
ties of triangles (and boxes) inductively. This enables us also to allow for the distinction 
of different copies of the same box within a triangle. 

To do this, each state of a triangle is associated with a rank k e N such that 
all states with the same rank come from the same copy of the same box. It is also 
important to keep track of the height each state comes from, because boxes of a tri- 
angle are included inductively on the height. For these reasons, a state of a triangle 
At°., = {Fr°. q ° , TT°, q °) is encoded as a quadruple v = (w, T, q, k) such that w is a 
state from the box T q with height h = \T\ and v is added to the triangle within the 
fc-th box inserted into the triangle. Moreover this box is a copy of T q . In that case 
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Build-Triangle(T°, q°) 

1 (23,/?) <— BUILD-BOX(0, <f ) 

2 (T,t) ^Mark( (23, /?),0,g°,l) 

3 fc <- 1 

4 for/i<- lto|T°| -1 

5 do for v = (w, T, q, I) a state of T with \T\ = h - 1 



6 do for q' € Q and a € T° \ T 

7 do if /3t,,H -W 

8 then rVru {a} Here |T'| = ft < |T°| 

9 (23, /3) <- BuiLD-Box(T',g') Compute n T , >q , 

10 k^k + 1 

11 <-MARK((S,/3),T',g',fc) Mark it with T' , q' , k 

12 Insert((T,t), Insert it into (T, r) 

13 ADD((T, t), {v, a, (i n ,T>, q >,T', q' , k))) 



14 return (T, r) 

N.B. Line 12, ia,T',q' denote the initial state of the box T , q , . 

Alg. 1. Construction of a triangle 

the state v maps to Tr°, q °(v) — pT,q(w), that is, the insertion of boxes preserves the 
correspondance to the states of A. Moreover the morphism tt° , 9 ° of a triangle At» i9 o 
is encoded in the data structure of its states. 

The construction of the triangle At<\ 9 ° is detailed in Algorithm 1. It relies on four 
procedures: 

- BuiLD-Box(T, q) returns the box n T q . 

- Mark((23, /?), T, q, k) returns a copy of (23, (3) where each state w from 25 is re- 
placed by the marked state v — (w, T, q, k). 

- Insert((T, r), (23, (3)) inserts (23, (3) within (T, r); the initial state of this disjoint 
union of automata is the initial state of (T, r). 

- Add((T, t), (v, a, v')) adds a new transition v v' to the automaton T; it is 
required that v and v 1 be states of T. 

The construction of the triangle Ap i? « starts with building a copy of the base box 
□g^o which gets rank k = 1 and whose marked initial state (in,Q, q ° , 0, 9°, 1) becomes 
the initial state of At°,«j°- Along the construction of this triangle, k counts the number 
of boxes already inserted in the triangle. The insertion of boxes proceeds inductively on 
the height h (Line 4) as follows: For each state v = (w, T, q, I) with height \T\ = h — 1, 
if a transition f3r, q (w) q' in A carries an action a eT°\T (Line 6) then a new box 
□t',,' of height h is inserted with T 1 = T U {a} (Line 12) and a transition v u' 
is added to the triangle T^t° ,q° in construction (Line 13) where v' is the marked initial 
state of the new box T , q ,. We stress here that t(v) t(v') is a transition of Ar°, q ° 
because t(v) = [3T, q {w) andr(w') = I3t> , q '{iu,T> , q >) — q 1 ■ This observation will show 
that r is a morphism. Another useful remark is the following. 

LEMMA 2.1. If a word u E S* leads in the triangle , q ° from its initial state 
(*n,0,<j° , 0, <Z° , 1) to some state v — (w, T, q, I) then each action of T occurs in u. 
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2.2 Building boxes from triangles 

We distinguish two cases when we build the box D^g whether the dependence graph 
(T, JO is connected or not. In case (T, Jf) is a connected graph then the box D Tq col- 
lects all triangles At, 9 ' for all states q' e Q. Each triangle is duplicated a fixed number 
of times and copies of triangles are connected in some particular way. Similarly to tri- 
angles, the states of a box are decorated with a rank k that distinguishes states from 
different triangles and also states from different copies of the same triangle. We adopt 
the same data structure as for triangles: A state v of a box is a quadruple (w, T, q, k) 
where w is a state of A^, and k G N. Whereas triangles of height h are built upon boxes 
of height g < h, boxes T q are built upon triangles At,«j' with the same set of transi- 
tions T — and consequently, with the same height. Similarly to the algorithm BuiLD- 
TRIANGLE, the algorithm that builds boxes uses an integer variable k that counts the 
number of triangles already inserted in the box in construction. 

In case the dependence graph (T, J[) is not connected, we let T\ denote the con- 
nected component of (T, Jf) that contains the least action a e T w.r.t. the total order C 
over S and we put T2 = T \T\. Then the box T ^ q is built upon a copy of the box 
□t 2 , 9 connected to copies of boxes TliQl for some states q\ e Q. 

The construction of the box T o, q o is detailed in Algorithm 5. It relies on ten pro- 
cedures: 

- BASE-Box(g) returns the base box D q . 

- Empty-Box returns a special new box called empty box. 

- Mark, Insert and Add are the procedures used for Build-Triangle. If (23, (3) 
is this special empty box then Insert((23, /?), (T, r)) replaces simply (23,/?) by 
(T,r). 

- Missing(T°, q, q') returns the set of all pairs (w, a) where w is a state that has 
been inserted in the triangle At°, 9 within a box d T „ tq „ such that \T"\ = \T°\ — 1 
and the action a e T° \ T" is such that there is a transition tt°^{w) q' in A 
(Alg. 2). Due to the structure of triangles, if (w, a) is a missing transition then there 
is no transition w -^A,T°,g w' with TT°, q {w') — q' in At° i(? . 

- Min-Rank(T°, q, 23, k) returns the minimal rank of a copy of a triangle Tr°, q 
inserted in 23 where k is the maximal rank of triangles in 23 (Alg. 3). 

- Max-Out-Degree(T°) returns the number of copies of each triangle A T °, g that 
should compose the box D T o 9 c It does not depend on q but it depends on the 

MlSSING(T°,g,q') 

1 M <- 

2 (T, r) <- BUILD-TRIANGLE(T , q) 

3 for w G Q A ,T°, q such that w = (w" ,T" ,q" ,k") and \T"\ = |T°| - 1 

4 do if T T °, q (w) q' with a G T° \T" 

5 thenM^Mu{(ti),a)} 

6 return (M) 

N.B. The triangle A T °, q = (T, r) computed at Line 2 consists of a set of 
states Qa,t«, 9 and a transition relation — >A,T°, g - 

Alg. 2. Set of missing transitions from a triangle At° , q to some state q 



Polynomial Synthesis of Asynchronous Automata 



7 



Min-Rank(T°, q, S, k) 

1 / <- fc + 1 

2 for w = (w,T',g',Z) inS 

3 do if q' = g and T' = T° 

4 then if/ </ 

5 then / = I 

6 return (/) 

Alg. 3. Minimal rank of A T ° , q in 23 



Max-Out-Degree(T°) 

1 m <- 

2 for q,q £Q 

3 don<- |MISSING(T°, g, 

4 if n > m 

5 then m <— n 

6 return (m) 

Alg. 4. Minimal number of copies required 



cardinality of all sets Missing(T°, q, q 1 ) with q,q' eQ (Alg. 4). The role of these 
copies is detailed below. 

- Clean(23, (3) remove all unreachable states from 23. 

- Decomposition(T ) returns the connected component T of (T°, Jf) that contains 
the minimal action of T° w.r.t. the total order C. 

The construction of the box T o q0 starts with solving the base case where T° = 
(Line 1). Assume now that the dependence graph (T°, Jf) is connected (Line 4). Then 
the box is initialized as the special empty box (Line 6). The number m of copies of each 
triangle Aj. >q is computed in Line 8 with the help of functions Max-Out-Degree 
and MISSING. Next these copies are inserted and the first copy of Ajo ^ gets rank k = 
1 (Lines 9 to 14). Consequently the initial state of the box U TOq o in construction is the 
first copy of the initial state iA,T°, q ° of the triangle AT°, g °, that is: (iA,T°, q ° ,T°,q°,l). 
Noteworthy copies of the same triangle have consecutive ranks. 

In a second step transitions are added to connect these triangles to each other (Lines 
15 to 25). Intuitively a a-transition is missing from the state w — (w", T", q", k") of 
the triangle At° , q to the state q' of A if \T° \T"\ = 1 — i.e. this state has been inserted 
at the highest level in Ato^ — and there exists in A a transition TT°, q (w) q' with 
a eT° \ T" but no transition w w' with tt°^{w') = q' in Aro^. 

The role of MISSING is to compute the missing transitions w.r.t. q, q', and T°. For 
each such missing transition (w, a) we connect each copy of w to the initial state of a 
copy of Ax°, 9 '. In this process we require two crucial properties: 
Pi: No added transition connects two states from the same copy of the same triangle: 

(w, T°, q, I) should not be connected to (iA.T°.q, T°, q, I). 
P2'. At most one transition connects one copy of At» j9 to one copy of At°,<j': If we 
add from a given copy of A T o >q a transition (w\,T°, q, I) (^A,T o ,g' , T°, q',l') 

and a transition (w2,T°, q, I) — — > (iA,T°,q>,T°,q', I') to the same copy of At»,,' 
then w\ = u>2 and a = b. 

The minimal number of copies required to fulfill these conditions is computed by 
Max-Out-Degree. For a fixed missing transition (w, a) from a state w of the triangle 
At° , q to a state q 1 of A, Lines 22 to 25 add a transition from the 7-th copy of w to the c- 
th copy of the initial state of Ay» q , with the property that j ^ c if q = q' (Condition Pi 
above). Moreover states from the j-th copy of Ar, g are connected to distinct copies of 
the initial state of /\r°,q' (Condition P2 above). 

Note here that each new transition (v,a,v') added to (23, [3) at Line 25 is such 
that p(v) (3(v') is a transition from AT°, q ° because j3{v) = TT°, q {w), (3(v') = 
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TT°,q'(tA,T°,q') = q' , and tt° , q (w) —* q' . Again, this observation will show that (3 is 
a morphism. A crucial remark for boxes of connected alphabets is the following. 

LEMMA 2.2. If a non-empty word u leads from the initial state of a triangle A^., 
to the initial state of a triangle At°,,' within the box T o q o then each action ofT° 
occurs in u. 

For simplicity's sake our algorithm uses the same number of copies for each trian- 
gle. This approach yields in general unreachable states in useless copies. The latter are 
removed by Clean at Line 26. 

Assume now that (T°, }f) is not connected (Line 29). Let T\ be the connected com- 
ponent of T° that contains the least action of T° w.r.t. the total order IZ over S. We put 
T 2 = T° \ T\. The construction of the box Dt ,, starts with building a copy of the 
box T . 2 q o. Next for each state w of ^T 2 ,q° an d eacn transition /?t 2 ,,( w ) l' with 
a G T\, the algorithm inserts a (new) copy of the box Dti,,' and adds a transition from 
the copy of w to the initial state of the copy of n Tl . By recursive calls of BuiLD-Box 
the box T0<q is built along a tree-like structure upon copies of boxes T , <q , where T' 
is a connected component of T°. 

2.3 Remarks 

From a mathematical viewpoint, Algorithms 1 to 5 are meant to define boxes IHt,, 
and triangles A T ,,. Thus two instances of Build-Triangle(T, q) produce the same 
object. For this reason, we speak of the triangle At,,. This is particularly important to 
understand the interaction between BuiLD-Box and MISSING. In case T is connected, 
Algorithm BuiLD-Box proceeds in two steps. First several copies of each triangle At,, 
are collected and next some transitions are added from some states of copies of At,, 
to the initial state of copies of At,,' ■ These additional transitions are computed in a 
separate function MISSING that depends on triangles. It is crucial that the triangles At,, 
used by the function MISSING be the same as the triangles At,, inserted in BuiLD- 
Box. 

From a more computational viewpoint, Algorithms 1 to 5 can obviously be imple- 
mented. To do this, we require that each triangle and each box be constructed only once. 
An alternative to this requirement is to adapt the parameters of the function MISSING 
and ensure that BuiLD-Box transfers its own triangle At,, instead of the pair (T, q) 
to that function so that the set of states computed by MISSING matches the set of states 
used by BuiLD-Box. However it need not to transfert its own triangle At,, to the func- 
tion Max-Out-Degree because this function works on triangles up to isomorphisms. 

In this section we have built a family of boxes and triangles from a fixed automaton 
A. This construction leads us to the definition of the unfolding of A as follows. 

Definition 2.3. The unfolding A\j n f of the automaton A = (Q, i, E, — F) is the 
box 13 z it ; moreover /?u n f denote the mapping [3s,tfrom the states ofA\j n f to Q. 

In the next section we study some complexity, structural, and semantical properties of 
this object. We assume that A satisfies Property ID of Definition 1.1 so that it accepts 
a regular trace language L. We explain how to build from the unfolding A\j n f a non- 
deterministic asynchronous automaton that accepts L(A). 
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BUILD-B0X(T°,g°) 



1 ifT° = 

2 then This is the base case 

3 return (BASE-Box(g°)) 

4 if T° is connected (and non-empty) 

5 then 

6 (2, (3) <- EMPTY-BOX Initialise (2, /?) to be 

7 fe <— the special empty box 

8 m <- Max-Out-Degree(T°) + 1 

9 for q e Q starting with q° 

10 do (T, t) <- Build-Triangle(T°, q) Compute A T °, q 

1 1 for I <— 1 to m 

12 do fc <— k + 1 Insert m copies 

13 (T, t') <— Mark((T, r), T°, g, fc) Marked with T°, g, fc 

14 lNSERT((B,/3), (T',t')) 

15 forg,g'eQ 

16 do M <— MlSSING(T°, g, g') List of missing transitions 

17 / < — Min-Rank(T° , g, 25, fc) — 1 Minimal rank of At° , q 

18 /' <— Min-Rank(T°, g', B, fc) — 1 Minimal rank of A T o_ q / 

19 for j <— 1 torn 

20 do c <- We have \M\ + 1 ^ m 

21 for (w, a) £ M 

22 do c <- c + 1 If g = g' then / = /' 

23 if/ + j = /' + c 

24 then c <— c + 1 We have c < m 

25 ADD((S, /3), ((to, T°, g, / + j), a, (i A , T o , T°, g', /' + c))) 

26 Clean(S,/3) 

27 return (S,/3) 

28 if T° is not connected (nor empty) 

29 then 

30 T «- Decomposition(T°) 

31 T 2 *~T°\T 1 

32 (S , /3o) <- Build-Box(T 2 , g°) 

33 (2,/3) ^MARK((S ,/3o),T 2 ,g°,l) 

34 k <- 1 

35 for to G Qn,r 2 ,q° ,q' £ Q and a G Ti 

36 doif/3 (w) g' 

37 then fc <— fc + 1 Insert a copy of D Tl _ q / 

38 CBV) <-BuiLD-Box(Ti,g') 

39 (B", 0") «- MARK((S', f3'),T u q', k) 

40 Insert((B,/3),(S",/3")) 

41 Add((S, /?), ((to, T 2 , g°, 1), a, (t DjTli ,, , Ti,g', fc))) 

42 return (B,/3) 



N.B. 

- In Line 25 «a,t° ,g' denotes the initial state of A T o q i . 

- In Line 35 Qa,T 2 ,q° denotes the set of states of T2 q o. 

- In Line 41 i u Tl q / denotes the initial state of D Tl q / . 



Alg. 5. Construction of a box 
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3 Properties of the unfolding algorithm 

In this section we fix a regular trace language L over the independence alphabet (£, || ). 
We assume that the possibly non-deterministic automaton A fulfills Property ID of 
Def. 1.1 and satisfies L(A) = L. First we sketch a complexity analysis of the num- 
ber of states in the unfolding A\j n i- Next we show in Subsection 3.2 how to build from 
A\jni an asynchronous automaton whose global automaton accepts L{A). 

3.1 Complexity analysis 

For all naturals ti ) Owe denote by (i n the maximal number of states in a box ¥>T, q 
with \T\ = n and q E Q. Similarly for all naturals n ^ 1 we denote by t„ the maximal 
number of states in a triangle Tr,q with |T| = n and q G Q. Noteworthy (3q = 1 and 
n = 1. Moreover r„ is non-decreasing because the triangle At', 9 is a subautomaton 
of the triangle At, 9 as soon as T' C T. In the following we assume 2 ^ n < 

Consider some subset T C S with \T\ = n. Each triangle T^T,q is built inductively 
upon boxes of height h < n — 1 (see Alg. 1). We distinguish two cases. First boxes of 
height h < n — 1 are inserted. Each of these boxes appears also in some triangle 
with fcT and \T'\ = n — 1. Each of these triangles is a subautomaton of , q with 
at most t„_i states. Moreover there are only n such triangles which give rise to at most 
n.r„_i states built along this first step. Second, boxes of height n — 1 are inserted and 
connected to states inserted at height n — 2. Each of these states belongs to some box 
with \T'\ = n — 2; it gives rise to at most 2.\Q\ boxes at height n — 1 because 
|T \ T'| = 2: This yields at most 2.\Q\./3 n - 1 new states. Altogether we get 

r n ^ n.r„_i.(l + 2.|Q|./3„_i) |27|.t„_i.3.|Q|A-i (1) 

Consider now a connected subset T C Z 1 with T| = n — 1. Then each box 25t,<j 
is built upon triangles Tt^' of height n — 1 (see Alg. 5). We can check that the value 
m =Max-Out-Degree(T) is at most r n _i + 1. Therefore the box 23t,«j contains at 
most 2.t„_i copies of each triangle Tt^'- Hence (i n -\ ^ 2.\Q\.t%_ 1 . 

Consider now a non-connected subset T C S with \T\ = n — 1. Then each box 
S-r^ is built upon copies of boxes "Bt 1 ^' where T' is a connected component of (T, }f). 
These boxes are inserted inductively along recursive calls of BuiLD-Box and they are 
connected in a tree-like structure. Each of these boxes contains at most 2.\Q\.t%_ 2 states 
as explained above. From each state of these boxes at most (n — 2).\Q\ new boxes are 
connected. Thus each box Hr'.q' is connected to at most c = \S\.2.\Q\ 2 .t^_ 2 boxes 
in the tree-like structure. Consequently there are at most 1 + c + c 2 + c 3 + ... + c™~ 2 
boxes. It follows that 

/?„_! < c n -\2.\Q\.rl_ 2 < 2 n .\S\ n -\\Q\ 2 - n -\r^ 2 (2) 

SinceT„_ 2 s; r„_i we get in both cases /3 n _i 2\ s \.\S\\ s \- 1 .\Q\ 2 -\ s \- 1 .{T n ^ l ) 2 -\ s \. 

We can now apply (1) and get r„ N.t^_ 1 where = 3.2^ .\E\^ .\Q\ 2 -^ and 
d = 2.\S\ + 1. Since t\ = 1, we get t„ ^ N d " . We can apply (2) with n instead of 
n- 1 and get /3„ 2.|g|.Af.(r„) 2 -(™ +1 ) 2. |Q|.A^.A^ d " _1 -( 2 ™+ 2 ). Finally we have 

< 2.\Q\. (3.2l^l.|^|l^l.|Q| 2 -l^l) (2 ^ l+2) '"' e O (\Q\( 2 -^ +2 ^ ,+1 ) (3) 
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3.2 Construction of an asynchronous automaton 

Finally we build from the unfolding A\j n f = (Qunf, «Unf, E, — >Unf , ^Unf) of A an 
asynchronous automaton A\j n { that accepts L(A). We define A\j n f as follows. First 
we put Qk = Qunf for each process k G K. Next the initial state is the li^-tuple 
(iunf , -, «Unf). Moreover for each action a, the pair ((<7fc) feeLoc(a) , (^) feeLoc(a) ) be- 
longs to the transition relation d a if there exist two states q, r € Qunf and a transition 
q — ^->unf r i n the unfolding such that the two following conditions are satisfied: 

- for all k G Loc(a), q k — >unf Q f° r some word u G (E \ Ek)*; 

- for all k € Loc(a), = r; in particular all are equal. 

Finally, a global state (qk)keK l $fi na l if there exists a final state q G Qunf such that for 
all k G K there exists a path qk — ^->unf 9 for some word «e (U\ E k )*. 

THEOREM 3.1. The asynchronous automaton A\j n i satisfies L{A\j n i) = L(A). More- 
over the number of local states Qk in each process is polynomial in \Q\ where \Q\ is the 
number of states in A; more precisely \Q k \ sC 0(\Q\ d ) where d = {2.\E\ + 2)W +1 . 

3.3 Sketch of proof 

By induction on the structure of the unfolding it is not difficult to check the following 
first property (see Appendix A). 

LEMMA 3.2. The mapping /?unf is a morphismfrom the unfolding A\j n f to A. More- 
over for all u G L(A) there exists v G L(A\j n {) such that v ~ u. 

The proof of Theorem 3.1 relies on an intermediate asynchronous automaton .Aunf 
over some extended independence alphabet (E, ||). We consider the alphabet E =_E U 
{(a,k) G E x K | a $ Ek} provided with the independence relation || such that ajffc iff 
a |(6, a |f (6, k) iff a G E k , and (a, k) ][(&, k') iff k = k! for all actions a,b G E and all 
processes k, k! G K. For each process k G K we put ^ = E k U{(a, k) \ a G E\E k }. 
It is easy to check that {Eh) keK is a distribution of (E, ||). Now-Aunf shares withTlunf 
its local states Qk and its initial state. A global state (qk) keK is final if there exists a final 
state q G i*u n f of the unfolding such that q k = q for all k G K. For each action a G E 
its transition relation d a is such that ((<Zfc)fceLoc(a)i (<7fc)fceLoc(a)) S 9 a if there exists a 
transition g -^->unf Q 1 such that qk = q and = for all k G Loc(a). Moreover for 
each internal action (a, k) G E \ E, we put (<?, g') G 9( ,fe) if 9 ""^Unf <?'■ 

We consider the projection morphism p : E — > Z 1 * such that = e, p(u.a) — 
p(u).a if a G E, and p(u.a) — p(u) if a G \ E. It is not difficult to prove that 

L(A\j n f) C p(£(.Aunf)) and p(L(.Aunf )) = i(-Aunf)- These two basic properties do 
not rely on the particular structure of A\j n { : They hold actually for any automaton. 

On the contrary the proof of the next lemma is very technical and tedious and relies 
on the particular construction of boxes and triangles (see Appendix B for some details). 

LEMMA 3.3. For each box U T<q = (HT,q,0T,q) we have p(LCB T , q )) C [L(3 T>9 )]. 
We can now conclude. By Lemma 3.2 we have [L(.Aunf)] = L(A). On the other hand 
the two basic properties show that [L(.Au n f)] C L(A\j n f). Conversely Lemma 3.3 
yields L(JW) - p(L(A Vnf )) C [L(A Vnf )}. Therefore L(A) = L(A^ f ). 
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Conclusion and future work 

We have presented a polynomial algorithm for the construction of non-deterministic 
asynchronous automata from regular trace languages. We have shown that this new un- 
folding method improves the complexity of known techniques in terms of the number of 
local and global states. Several variations of our approach lead to analogous complexity 
results. We have selected here the simplest version to analyse. But it might not be the 
more efficient in practice. 

Interestingly this unfolding method can be adapted to the implementation of any 
globally-cooperative compositional high-level message sequence charts as investigated 
in [5], At present we are developping more involved unfolding techniques in order to 
construct deterministic safe asynchronous automata [13]. We are also investigating a 
possible extension of our unfolding technique to infinite traces [4]. 
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A Proof of Lemma 3.2 

An immediate induction shows that for each box T q = (Sr,q , pT,q) and each triangle 
At, 9 = (Tr,q,TT,q), the mappings /?t i9 and tt, q are morphisms from Hr.q to At, q and 
from 7r, q to Ar, q respectively. In particular L(A\j n {) Q L(A). 

By induction on the size of T we prove that for all paths q — qi of Ar.q there 

exists an equivalent word v! ~ u such that «d,t,<j — — > u is a path of "2>T.q and /3r, q {v) = 
qi. This property is trivial for the empty set T = because ,A0 )9 and ®0 >9 are reduced 
to the state q. We shall distinguish two cases whether T is connected or not. 

Assume first that T is a connected set of actions. We proceed by induction on the 
length of u. The property holds for the empty word because 0T,q(in,T,q) — q- Let 

q — qi -^-> q 2 be a path of Ar.q- By induction there is u' ~ u such that «n,T, q — — » w 
is a path in and (h,q(v) = qi - Then, by construction, the state v — (w, T, q' , k) 
comes from some triangle Ax. 9 ' • Furthermore w comes from a box U T ,, q ,, inserted in 
Ar i? ': We have to = (w",T", q" , k"). We distinguish several cases. 

1. If a G T \ T" and |T \ T"| = 1. Then (to, a) belongs to Missing(T, <?', q 2 ). 
Consequently Line 25 of Alg. 5 shows that u — ^ («a,t,<j 2 j ^ <Z2, for some 
integer fe' and 0T, g (iA,T,q 2 , qr 2 , = t t , 92 («A,T,g 2 ) = 92- 

2. If a G T \ T" and |T \ T"| ^ 1. Then Line 13 of Alg. 1 shows that w to' with 
w' = («n.T"u{a}.g 2 : ^" u { a }> 92, k') for some integer fc' is a transition of 7r ;9 ' 
and tt,<j< («/) = 92- Consequently v («/, T, g', fc) is a transition of 2$T,g and 
f3 T ,q(w',T,q',k) = q 2 (see Line 14 of Alg. 5). 

3. If a 6 T". By construction the path %n,T,q v of 3t, 9 consists of the sequence 
of transitions io,T,q — ^ v\ v such that u\.u 2 = u', v\ = (wi,T,q',k), 
w\ — (^□,T",g" ,T" , g", fc") and all states v 2 reach along the path v\ v come 
from the same box U T ,, q ,, of the same triangle A^,/ that is v 2 is some tuple 
{{w 2l T" ', q", k"),T, q', k). Consequently each action 6 that occurs in u 2 belongs 
to T": It follows that q" q\ <j 2 is a path of .At",<j"- By induction there 

u' 

is an equivalent word u' 2 ~ u 2 .a such that in,T", q >> w' is a path of 23t",<j" 

and /3t", 9 " (V) = 172- Consequently ui ((«/, T", q", k"), T, q', k) is a path of 
T, T , q and p T , g ((w',T", q", k"),T, q', k) = q 2 (see Line 12 of Alg. 1 and Line 14 
of Alg. 5). 

Suppose now that T is an unconnected set of actions. Let q — q\ be a path of 
•A-T,q, Ti be the connected component that contains the least action of T and T 2 = 
T \ Ti. If u\T\ = e then g — ^ q\ is also a path of Ar 2 , q - Consequently, by induction 

there exists u' ~ u such that iu,T 2 , q u> is a path of ®t 2 ,q and (3r 2 ,q{w) = gi. 
It follows by Line 33 of Alg. 5 that in,T,q ( w ,T 2 ,q,l) is a path of H>T, q and 
PT, q (w,T 2 ,q,l) = qi. If u\Ti = a. Mi and u|T 2 = u 2 then q q 2 g 3 171 
is also a path of Ar, q because u 2 .a.u\ ~ u and yi satisfies ID. Moreover q ^> g 2 is 
a path of Ar 2 , q and q 3 q\ is a path of Ar im - Consequently, by induction, there 

exists u' 2 ~ u 2 such that in,T 2 ,q w 2 is a path of ®T 2 , g and /3t 2i9 (w 2 ) = <? 2 , and 
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on other hand, there exists u[ ~ u\ such that in ,Ti,g 3 — ^ i"i is a path of "Bti,^ and 

/3ti, 93 (wi) = 9i. Then Alg. 5 ensures that in ,T,q — ^ (w2,T 2 ,q,l) is a path of 23t,<j, 

pT,q{w2,T 2 ,q, 1) = g 2 (Line 33), (i n ,T u q 3 ,T l7 q 3} k) (toi, Ti, g 3 , fc) is a path 
of 23^,9 for some integer fc and ^T,q(wi,Ti,qs,k) = q\ (Line 40). Finally we have 

(w2,T 2 ,q, 1) («n,Ti,g 3 ,7i,g3,fe) (Line 35 and 41). It follows that in,r, 9 " 2 -^' 1 
(wi, T\, g 3 , k) is a path of St^, /3t, 9 (wi, Ti, (73, £;) = gi and ul 2 .a.v! x ~ u. 

B Proof sketch of Lemma 3.3 

The complete proof of Lemma 3.3 requires about 20 pages of tedious technical details. 
In this appendix we present the two main ideas that lead the argument. We need first to 
introduce some basic definitions and notations precisely. 

Some basic definitions and notations. Let A be some automaton over S. A path of 
length n e N \ {0} is a sequence of transitions ( qi -^-> q[ ) such that q\ = q i+ i 

V / »6[l,n] 

for all integers < i < n. For all words u 6 S* we write q — g' to denote a path 
I qi q'i ) where q\ = q, q: ' = q', and u = a\...a n . Then q is called the domain 

V /*€[l,n] 

of g g' and g' is called its codomain. A path of length is simply a state q of A. Its 
domain and codomain are equal to q. 

If s and s' are two paths such that the codomain of s is the domain of s' then the 
product s ■ s' is defined in a natural way: If the length of s is then s ■ s' = s'; if the 
length of s' is then s ■ s' = s; otherwise s ■ s' is the concatenation of s and s'. 

Note that if s is a path of the length I > then it is the product of two paths 
s = si ■ s 2 where the length of s\ is 1 and the length of s 2 is I — 1. Moreover such a 
product is unique. This remark allows us to define mappings for paths inductively on 
the length. 

Projections of global states and executions. Assume now that A is (the global sys- 
tem of) an asynchronous automaton over the distribution {S k ) keK . Then a path of 
A is called an execution. For convenience we shall consider the component automata 

(A k ) 

keK defined as follows: For each process j e K, Aj — {Qj,ij, Sj, y j,Qj) 
where q 3 gj if there are q = {q k ) keLoc{a) and q = (q' k ) feeLoc(o) such that 

(q, q') S d a . Note here that j e Loc(a) since a € Sj. 

Now the projection s\k of an execution s of A onto a process j 6 if is a path of Aj 
defined inductively as follows: 

- s\j = qj if s is a path of length that corresponds to the global state (q k ) k eK', 

~ s \j = 1j Qj ' ( s '\j) ^ s i s tne product s = t ■ s' where t is a transition 
(qk)keK W k )keK and j e Loc(a). 

- s|j = s'|j if s is the product s = t ■ s' where i is a transition g g' and 
j £ Loc(a). 
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Executions of extended asynchronous automata. In the paper we define the extended 
asynchronous automaton A\j n f of the unfolding automaton .Aunf- This definition can 
naturally be generalized to any automaton. Let A be an automaton and A the corre- 
sponding extended asynchronous automaton. We say that an execution s = q — ^ q' of 
A is arched if there are two states v and v' in A such that for all k e K, q\k = v and 
q'\k = v'. Noteworthy each execution that leads an extended asynchronous automaton 
A from its global initial state to some global final state is arched. 

We define now a function 7 that associates each action of S to the corresponding 
action of £ in a natural way: For all actions (a, k) e S \ S, j(a, k) = a and for all 
actions a e S, 7(a) = a. As usual this map extends from actions to words and we 
get 7 : E -> S\ We can also extend the mapping 7 as a function from paths of 
component automata Ak to paths of A as follows. For each sequence s that is a path of 
some Ak, we define j(s) inductively on the length of s by 

- 7(s) = q if the length of s is and s = q. 

- 7(s) = q ^—l q' ■ 7(s') if s is a product s = t ■ s' where t is a transition q — —> q' . 
Clearly if s is an execution of A and k a process of K then s\k is a path of A^ and 
j(s\k) is a path of A. 

Definitions associated to unfoldings. Let T be a non-empty subset of S. We consider 
the triangle T T ^ q = (<3a,t, 9 , «a,t, 9 , — >a,t, q , ^a,t, 9 )- Let w be a state from 7^,. By 
construction of Tr. q , v is a quadruple (u>, T", g', k') such that w is a state from the box 
□t',<j' and k' E N. We say that the box location of v is Z D (w) = (T', </, fc'). We define 
the sequence of boxes reached along a path s = q — q' in 7^ as follows: 

- If the length of s is and s corresponds to state q e Qa.t.ij then L D (s) = l a (q). 

- If s is a product s = s' ■ t where t is the transition q q 1 then two cases appear: 

• If l D (q) = l D (q') then L D (s) = L D (s'); 

• If l D (q) ^l D (q') then h n (s) =h D (s').l D (q') 

Similarly we define the sequence of triangles L A (si) reached by a path s\ in a box 
^Ti where Ti is a non-empty connected set of actions and the sequence of boxes 
L D (s 2 ) reached by a path s 2 in a box "BT 2 ,q 2 wnere T 2 is an unconnected set of actions. 

7vvo main properties of unfoldings. The following proposition states that all processes 
behave similarly in an extended asynchronous automaton built from boxes or triangles. 

PROPOSITION B. 1 . Let 23^ m be a box with T\ a non-empty connected set of actions, 
r Br 2 ,q2 k> e a box with T 2 an unconnected set of actions, and 7r 3 ,q 3 be a triangle with 
T3 a non-empty set of actions. Let S\, s 2 and S3 be arched executions ofE^.q^ < Bt 2 ,<}2 
andTT 3 .q 3 respectively. Then: 

1. Vk,k'e Loc(Ti), L A (7( Sl |/c)) = L A ( 7 (si|fc')); 

2. VM' G ^,L n (7(s 2 |fc)) =L D ( 7 ( S2 |fc')); 

3. Vfc, k' G K, L n ( 7 ( S3 |fc)) = L D ( 7 ( S3 |A:'))- 

Proof. Property 2 and Property 3 stem from the remark that 23t 2 ,(}2 an d ^3,93 
made of boxes connected along a tree-like structure. The proof of Property 1 is more 
subtle. Let a be an action of T\ and k, k' be two processes of Loc(a). We proceed 
by contradiction. Let T and T be the first triangles that differ in h A ("f(si\k)) and 
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L A (7(si|fc')). Let cbe the number of a-transitions that occur in s\ just before 7(^1 |fc) 
and 7(si|fe') reach T and T . Since s\ is arched, 7(si|fc) and 7(si|fc') have to meet 
eventually for the last state. Therefore 7(si|fc) and 7(si|fc') have to leave triangles T 
and T respectively. Consequently, there is a (c + l) th a-transition q q' in si. 
Moreover, this transition is such that q\k is a state from T whereas q\k' is a state from 
T', that is: q\k ^ q\k'. This contradicts the definition of d a . ■ 

PROPOSITION B.2. Let Tir.q be a box. Let s = q — q' be an arched execution of 
H>T,q with q\k — w and q'\k = w' for all k G K. Then there is a word v G S* such 
that v ~ p(u) and w — u>' « a /?af/z ofS>T,q- 

Proof. We proceed by induction on the size of T. The case where T = is trivial 
because Hq^ consists of a single state q. Suppose that the property holds for all subsets 
T' C T and all states q' G Q. Assume first that T is a connected set of actions. By 
Proposition B.l, we know that L A (7(s|fc)) = L A (7(s|A:')) for all processes k, k! G 

Loc(Ti). We claim first that we can find an other execution s' = q q' such that for 
all processes k,k' G K, L A (7(s'|fc)) = L A (7(s'|fc')) and moreover p(u) — p(u'). Let 
L A (7(s'|fc)) = Ti...T n be the sequence of triangles visited by 7(5' |fc). We can split the 
execution s' into several smaller arched executions si, . . . , s n such that each execution 
Si is located within triangle %. Similarly each execution Si can be split into several 
smaller arched executions s\, s' m such that each execution s'j is located within a box 
23 j inserted in %. Then we can conclude by applying the inductive hypothesis on each 
smaller box. 

Assume finally that T is an unconnected set of actions. By Proposition B. 1, we know 
that for all processes k, k' G K, L D (7(s|fc)) = L D (j(s\k' j). Then we can conclude by 
applying the inductive hypothesis on the smaller boxes visited by s. ■ 

Lemma 3.3 follows now immediately: We have p(L(23t, 9 )) Q [L(23t,<j)]- 



